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1. (Amended) A process for generating a library of oligonucleotides that are specific for a given 
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set of nucleic acids, comprising: 

a) generating random oligonucleotides, wherein said oligonucleotides are of a 
uniform length comprising a single stranded, central segment of randomly varied bases 
and flanking segments of defined sequences on each side of said central segment; 

b) hybridizing the random oligonucleotides with a nucleic acid-containing template 
of biological or synthetic origin under hybridization conditions that enable the formation 
of duplexes and using blockers to avoid hybridization of said flanking segments; 

c) eliminating non-specific duplexes using conditions that minimize or abrogate 
mismatches; 

d) separating the hybridized oligonucleotides from the duplexes obtained in step c); 
and 

e) amplifying the hybridized oligonucleotides. 

2. (Amended) A process as defined in claim 1, further comprising subtracting between two 
different oligonucleotide libraries (OLl and OL2) which contain similar sequence motifs. 

3. (Amended) A process as defined in claim 2, wherein said subtracting comprises: 

a) generating single stranded versions of OLl and OL2; 

b) annealing the OLl strands with an excess of OL2 strands, under 
hybridization conditions; 

c) partitioning double stranded hybrids (OLl:OL2) and single stranded OL2 
from single stranded OLl; 

d) amplifying the single stranded OLl; and 

e) repeating steps a) to d) to obtain OLl oligonucleotides with reduced 
affinity for OL2. 
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4. (Amended) A process as defined in claim 1, wherein said central segment comprises 10-40 
bases and each one of said flanking segments comprises 10-40 bases. 

6. (Amended) A process as defined in claim 1, wherein the nucleic acid-containing template 
comprises at least one of genomic or synthetic DNA or RNA, or cDNA. 

8. (Amended) A library of oligonucleotides produced by the process comprising: 

a) generating random oligonucleotides, wherein said oligonucleotides are of a 
uniform length comprising a single stranded, central segment of randomly varied bases 
and flanking segments of defined sequences on each side of said central segment; 

b) hybridizing the random oligonucleotides with a nucleic acid-containing template 
of biological or synthetic origin under hybridization conditions that enable the formation 
of duplexes and using blockers to avoid hybridization of said flanking segments; 

c) eliminating non-specific duplexes using conditions that minimize or abrogate 
mismatches; 

d) separating the hybridized oligonucleotides from the duplexes obtained in step c); 
and 

e) amplifying the hybridized oligonucleotides. 

9. (Amended) Use of a library of oligonucleotides produced by the process of claim 1 in a 
diagnostic kit. 

10. (Amended) Use of a library of oligonucleotides produced by the process of claim 1 to inhibit 
gene function. 
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11. (Amended) A method of diagnosis comprising use of a library of oligonucleotides produced 
by the process of claim 1 . 

12. (Amended) Use of a library of oligonucleotides produced by the process of claim 1 wherein 
said oligonucleotides are bound to a solid support. 

13. (Amended) A use as defined in claim 12, wherein the solid support is at least one of a 
membrane, glass slide, coated glass slide, printed arrays, microspheres or chromatographic 
media. 

14. (Amended) Use of a library of oligonucleotides produced by the process of claim 1, wherein 
said oligonucleotides are hybridized to nucleic acid arrays. 
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STRIKETHROUGH VERSION OF AMENDED CLAIMS 

1. A process for generating a library of oligonucleotides that are specific for a given set of 
nucleic acids, comprising: 

a) generating random oligonucleotides, wherein said oligonucleotides are of a 
uniform length comprising a single stranded, central segment of randomly varied bases 
and flanking segments of defined sequences on each side of said central segment; 

b) hybridizing the random oligonucleotides [of step a)] with a nucleic acid- 
containing template of biological or synthetic origin under hybridization conditions that 
enable the formation of duplexes and using blockers to avoid hybridization of said 
flanking segments; 

c) eliminating non-specific duplexes [formed in step b)] using conditions that 
minimize or abrogate mismatches; 

d) separating the hybridized oligonucleotides from the duplexes obtained in [step] 
c); and 

e) amplifying the hybridized oligonucleotides [obtained in step d)]. 

2. A process as defined in claim 1, further comprising [the step of f)] subtracting between 
two different oligonucleotide libraries (OLl and OL2) which contain similar sequence motifs. 

3. A process as defined in claim 2, wherein said subtracting [in step f) consists in] 
comprises : 

a) generating single stranded versions of OLl and OL2; 

b) annealing the OLl strands with an excess of OL2 strands, under 
hybridization conditions; 

c) partitioning double stranded hybrids (OLl:OL2) and single stranded OL2 
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from single stranded OLl; 

d) amplifying the single stranded OLl [obtained from step c)]; and 

e) repeating [steps] a) to d) to obtain OLl oligonucleotides with reduced 
affinity for OL2. 

4. A process as defined in [any one of claims 1 to 3] claim 1 , wherein said central segment 
comprises 10-40 bases and each one of said flanking segments comprises 10-40 bases. 

6. A process as defined in [any one of claims 1 to 3] claim 1 , wherein the [template of step 
b) contains] nucleic acid-containing template comprises at least one of genomic or synthetic 
DNA or RNA, or cDNA. 

8. A library of oligonucleotides produced by the process [of any one of claims 1 to 7] 
comprising: 

a) jgenerating random oligonucleotides, wherein said oligonucleotides are of a 
uniform length comprising a single stranded, central segment of randomly varied bases 
and flanking segments of defined sequences on each side of said central segment; 

b) hybridizing the random oligonucleotides with a nucleic acid-containing template 
of biological or synthetic origin under hybridization conditions that enable the formation 
of duplexes and using blockers to avoid hybridization of said flanking segments: 

c) eliminating non-specific duplexes using conditions that minimize or abrogate 

mismatches; 

d) separating the hybridized oligonucleotides from the duplexes obtained in step c): 
and 
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e) amplifying the hybridized oligonucleotides. 

9. Use of a library of oligonucleotides produced by the process of [any one of claims 1 to 7] 
claim 1 in a diagnostic kit. 

10. Use of a library of oligonucleotides produced by the process of [any one of claims 1 to 7] 
claim 1 to inhibit gene function. 

11. A method of diagnosis comprising use of a library of oligonucleotides produced by the 
process of [any one of claims 1 to 7] claim 1 . 

12. Use of a library of oligonucleotides produced by the process of [any one of claims 1 to 7] 
claim 1 wherein said oligonucleotides are bound to a solid support. 

13. A use as defined in claim 12, wherein the solid support is at least one of a membrane, 
glass slide, coated glass slide, printed arrays, microspheres or chromatographic media. 

14. Use of a library of oligonucleotides produced by the process of [any one of claims 1 to 7] 
claim 1 , wherein said oligonucleotides are hybridized to nucleic acid arrays. 
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TITLE OF THE INVENTION 

Process for the Generation of Oligonucleotide Libraries (OLs) 
Representative of Genonnes or Expressed mRNAs (cDNAs) and Uses 
Thereof 

FIELD OF THE INVENTION 

The present invention relates to a process for the generation 
oligonucleotide librairies (OLs) representative of genomes or expressed 
mRNAs (cDNAs) and to the uses thereof. In particular, the present 
invention relates to a process for the generation of oligonucleotide 
librairies comprising oligonucleotides of uniform length. The present 
invention further relates to the uses of these OLs in numerous 
biotechnoiogical applications, including the identification and/or 
characterization of biological materials, clinical diagnosis (DNA/RNA 
level), preparative extraction of specific mRNA (and genes) and genomic 
research/mapping, 

BACKGROUND OF THE INVENTION 

The generation of genomic DMA libraries, or cDNA libraries and the 
maintenance, and handling of these libraries are critical procedures in the 
field of genomics and/or biotechnology. In classical libraries the relevant 
segments of DNA are cloned into vectors, which are maintained and 
propagated in particular biological systems (tn vivo). Alternatively, libraries 
{in vitro) can be directly constructed from genomic DNA or cDNA. They 
contain linkers at the 5' and 3' ends of the DNA which allow PGR 
amplification of the library. The information stored in these libraries 
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contains repetitive sequence elennents that originated from repetitive 
DNA, or high copy mRNAs. This results in a significant redundancy, which 
can complicate the use and the outcome of using classical libraries. 
Another important feature which reduces the utility of classical libraries is 
the heterogeneity in size of the members comprising the library. This 
limits the usefulness of classical libraries in subtractive hybridization 
procedures (1-2) which are dependent upon the length, complexity and 
the redundancy of the libraries, and which therefore are particularly 
sensitive to the choice of method and the number of cycles performed. 
In fact, one must tailor the hybridization conditions to accommodate the 
heterogeneous length and redundancy of stored information in order to 
perform subtraction. Thus, the results are more "laboratory-specific" than 
library-specific. 

A number of diagnostic methods that involve nucleic acid hybridization 
have arisen in recent years. Most of them are designed to provide 
qualitative information about the presence of a specific sequence motif in 
a complex analytical mixture of nucleic acids and use a detection system 
based on PGR and/or DNA chip hybridization technologies (3-7). For both 
of these technologies, diagnostic oligonucleotides constitute an essential 
part of the detection system. These oligonucleotides are primarily chosen 
based on the sequence data of the nucleic acids to be detected, in spite 
of the power of hybridization to correctly identify a complementary strand, 
it does face limitations. In fact, the difference in stability between a 
perfectly matched complement and a complement mismatched at only 
one base can be as little as 0.5° C (8). This is the fundamental limitation 
to the power of DNA hybridization for specific identification of a cognate 
strand. Therefore, the diagnostic power of any chosen oligonucleotide 
must be validated using an analytical mixture whose sequence context is 
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not totally known. The problem of adequate probe selection is time and 
labour-consuming. On the other hand, the growing complexity of 
detection systems based on oligonucleotide technologies requires a fast 
selection of a large number of short oligonucleotides. 

5 Akopyants et al (7) performed subtractive hybridization using bacterial 
DMAs digested by high-frequency restriction enzymes. The use of such 
restriction enzymes tends to generate DNA fragments having a broadly 
similar size, about 500 base pairs. However, the uniformity is not 
rigorous. Moreover, the library created by these restriction fragments still 
10 contains a significant number of redundant sequences; consequently, 
patches of short polymorphism embedded in homologous sequences are 
going to be missed when such a library is used. 

U.S. Patent No. 5,270,163 (8) teaches a method for the isolation of 
nucleic acids using high-affinity nucleic acid ligands. This method has 
been termed the SELEX method (Systematic Evolution of Ligands by 
Experimental Enrichment) and is based on the use of proteins or small 
molecules, but not nucleic acids, as targets. The selection of 
oligonucleotides in the SELEX method relies on the three-dimensional 
(3D) shape of the oligonucleotides and their fit into the structures of the 
target molecules. In contrast to this, the selection of oligonucleotides in 
the present invention is based on hybridization with target nucleic acid. 
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Armour et al (11) describes the quantitative recovery of amplifiable 
25 probes hybridised to an immobilised target. The amplifiable probes 
consist of PGR or restriction fragments and their technique is meant to 
assess the copy number of loci. 
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There thus remains a need for oligonucleotide libraries which allow for the 
use of uniform hybridization conditions to perform selection and/or 
subtraction while minimizing or eliminating redundant sequences. 
Advantageously, these libraries can be used in the selection of highly 
5 informative and target-specific probe libraries. The present invention 
seeks to meet these and other needs. 

SUMMARY OF THE INVENTION 

10 The procedure described herein results in the generation and selection 
of oligonucleotide probes with a high specificity for a given system. These 
oligonucleotides cover the entire length of the target DNA, thus increasing 
detectability which might be lost in classical oligo-detection systems due 
to secondary DNA structure or DMA deletions present in an analyte 

15 mixture. At the same time, they present inexpensive variants of a 
multiplex oligonucleotide-detection approach, since they are not required 
to be individually synthesized. 

More specifically, in accordance with the present invention, there is 
20 provided a process for the generation of oligonucleotide libraries, or OLs. 
The present invention teaches a process for generating OLs from 
genomic DMAs and cDNAs, and for performing the subtraction of these 
libraries. 

25 The present invention further teaches OLs which allow the use of 
hybridization conditions which are controllable and reproducible. In 
addition, the invention teaches a process for the selection of uniform 
length OLs which minimizes or eliminates redundant sequences and 
reduces complexity. The result is the production of highly-informative and 

30 target-specific probe libraries. 
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An object of the present invention is therefore to provide a process for the 
generation of oligonucleotide libraries comprising OLs of uniform length 
which are self-amplifiable and easily subjected to subtraction. 

Another object of the invention is to provide OLs which are compatible 
with DNA array technology. Indeed, an array of diverse mixtures of 
oligonucleotides which show differential hybridization patterns could be 
the best choice for the next generation of DNA diagnostics. 

Other objects, advantages and features of the present invention will 
become more apparent upon reading of the following non-restrictive 
description of preferred embodiments thereof, given by way of example 
only with reference to the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 : Schematic representation of the experimental procedure for 
the preparation of OL. Denatured DNA is bound to the membrane and 
hybridized to the random oligonucleotides library in the presence of 
blockers. Theses blocking primers disallow the unspecific hybridization 
of left and right oligonucleotide arms used for PGR amplification of the 
OL. (ss - single-stranded DNA, ds - double-stranded DNA). 
Hybridization and PGR amplification of OL are described more 
particular below, in the Experimental Methods. 

Figure 2 : Dot blot hybridization of OL targeted against different genomes. 
The first row represents the dot blot hybridization of random probes with 
the specified genomic DNA (adenovirus, pBluescript and lambda). The 
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last row shows dot blot hybridization of mixed adenovirus and lambda- 
selected OL. The other rows are analytical dot blot hybridizations of 
selected OLs with each of the genomes indicated. The procedures of 
preparative and analytical hybridization are described in the Experimental 
5 Methods, below. 



Figure 3 : Specificity and probe distribution of OL generated from 
adenoviral genome. (A) The corresponding genome and adenoviral DNA 
were run on a 1% agarose gel stained with ethidium bromide. The type 

10 of restriction enzyme and DNA are indicated on the top of each gel lane. 
(B) Southern hybridization of the same gel using adenovirus OL as an 
hybridization probe (see Figure 2, row 2). It should be noted that under 
the experimental conditions, there was no cross-hybridization with either 
lambda or human DNA. (C) The same membrane was stripped and 

15 rehybridized with a OL directed against a 3648 bp-long restriction 
fragment. This subset of adenovirus OL was prepared by cutting the 
membrane corresponding to the 3648 bp band from a similar southern 
blot and reamplified by PGR as described in the Experimental Methods, 
below. Thus, it is shown that OL specificity may be enhanced by 

20 controlling the choice of targeted DNA fragments in the next round of 
selection. 

Figure 4 : The distribution of OL along genomic DNA. The densitrometric 
scan of radioactive signal from OL was integrated over total adenoviral 
25 genome (Figure 3, lanes 3 and 5) using Scion Image software (Scion 
corporation, Frederick, Maryland). The signal intensity of OL probes 
hybridizing to restriction fragments is linearly proportional to the length of 
DNA, 
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Figure 5 : Subtractive enrichment of OL. (A) The tester OL is presented 
by the nnixture of two genomes (Adenovirus type 2 and Lambda phage). 
The driver OL was produced from the Lambda genome only. The single 
stranded (ss) OL from driver DNA was used to pool out the 
5 complementary single stranded mixed tester OL. After removing the 
subtracted fraction, the remainder of the mixed OL was used as a probe 
in the analytical hybridization step. The mixed OL probes were analyzed 
by dot blotting as described (B) before subtractive enrichment and (C) 
after subtractive enrichment by hybridization to genomic Adenovirus and 
10 Lambda DNA. 

Figure 6 : Relative distribution of 20-mers with the different number of 
mismatches which do hybridize to targeted DNA. The abscissa shows the 
number of mismatches present in the 20-mer, while the y-axis illustrates 

15 the corresponding relative frequencies. The distribution profile was 
obtained by calculating the number of combinations for each particular 
number of mismatches which are thermostable at 52° C, The y-axis was 
normalized to reflect the relative distribution (%) over the total number of 
captured oligonucleotides (100%). The majority of n-mers captured after 

20 the first round of selection will be 20-mers with less then 6 mismatches. 
This is described further in the Detailed Description, below. 

DETAILED DESCRIPTION 

25 The present invention thus provides a process for the generation of 
oligonucleotide libraries having the following characteristics: 

1) A uniform length of about 60 bases, comprising a central segment of 
about 20 bases randomly varied to represent all possible combinations. 
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and segments of about 20 bases of a defined sequence flanking the 
central segment on each side; 

2) A uniform number of copies for each sequence motif (consequently, 
there are no differential hybridization kinetics which could originate from 

5 the presence of repetitive DNA); and 

3) A melting profile which is characterized by a sharp transition from 
double stranded to single stranded (or vice versa) oligonucleotides. This 
is a critical advantage in subtractive hybridization procedures. 

1 0 The use of these OLs enhances the specificity of hybridization to nucleic 
acids isolated from various sources, thereby allowing for the preparation 
of oligonucleotide mixtures useful in the detection and quantification of 
specific nucleic acids or nucleic acid mixtures. 

15 In one particular embodiment, the starting pool of oligonucleotides is 
chemically synthesized and consists of a random region of a fixed length 
(L), flanked by a constant sequence (primer binding sites, PBS). The 
random oligonucleotide pool covers n copies (n=1,2,3...) of all sequence 
combinations of length L, i.e. 4^, which is a total of 10^^ different sequence 

20 motifs for L==20 nucleotides. The basic length of oligonucleotides is long 
enough to generate uniform sequence motifs for a particular biological 
system. The complexity of the library (10^^) overcomes the complexity of 
the template (which is usually between lO'^-IO^). The random pool is then 
hybridized with a nucleic acid template isolated from any selected source 

25 and the unbound oligonucleotides are washed away under stringent 
conditions. The remaining, template-bound oligonucleotides are then 
subjected to amplification, using PGR or other methods known to those 
of skill in the art and using primers complementary to the constant 
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flanking segments, thus producing a library of oligonucleotides capable 
of selectively hybridizing to nucleic acid templates. 

The choice of 20-mer for the length of the oligonucleotide library is not 
arbitrary but is based on the rationale that the length of the particular 
sequence motif should be long enough to be unique for even the most 
complex genomes. The L-mer of length L will be, on average, repeated 
every 4 power L base pairs. The longer the L, the greater the average 
distance between 2 identical sequence motifs of length L will be. 
However, this particular combinatorial approach is at best approximative, 
and other lengths may be suitable as well. The choice of length will 
depend on such factors as the length and/or complexity of the genome 
to be detected and compatibility with current nucleic acid amplification 
and DNA array technologies. 

In another embodiment, the amplification described above is performed 
with one of the PGR primers being labelled with biotin, providing means 
for purification of the labelled products with streptavidin-labelled 
substrates (12) or other similar methods. The amplified mixture of 
unlabelled oligonucleotides specific to one template is hybridized with 
labelled mixtures of oligonucleotides selected for specificity to one or 
several nucleic acid templates, and the unbound material is collected. In 
this manner, a mixture of nucleotides which is enriched for nucleic acids 
present in the unlabelled library only can be generated. 

The process is based on stringent hybridization. Furthermore, high 
fidelity hybridization between pools of oligonucleotides and templates 
(genomic DNA or cDNA) is the basic mode of transfer of genomic 
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information into OLs, An efficient subtractive hybridization procedure is 
used to accommodate the features of the aforementioned OLs. 

The present invention is illustrated in further detail by the following non- 
5 limiting example. 

EXAMPLE 1 

Generation of OLs, Use Thereof in Subtractive Hybridization to 
Generate Subtractive Oligonucleotides Libraries (SOLs), and Use 
10 of OLs or SOLs in Hybridization Experiments 

EXPERIMENTAL METHODS 

15 DNA / oligonucleotides 

The starting random DNA pool was synthesised by GIBCO BRL 
(Budington, Canada), (RAN), 5'-GCCTGTTGTGAGCCTCCTGTCGAA- 
N20-TTGAGCGTTTATTCTTGTCTCCC-3'. The corresponding left and 
right arms were (LEFT) 5'-GCCTGTTGTGAGCCTCCTGTCGAA-3' and 

20 (RIGHT) 5'-BioGGGAGACAAGAATAAACGCTCAA-3'. The 5'-end 
biotinylated oligonucleotides were used to pool out complement 
strands, using BioMag magnetic particles (PerSeptive Biosystems, 
Framingham, MA ). During preparative hybridization, the left and right 
arms were blocked by (LEFT) 5 -TTCGACAGGAGGCTCACAACAGGG-3' 

25 and (RIGHT) 5'GGGAGACAAGAATAAACGCTCAA-3\ Theses 
oligonucleotides are termed 'blockers' in the text 

The following genomic DNA was used to produce OL: Adenovirus DNA 
Type 2, (GIBCO BRL), Lambda DNA cl857 ind1 Sam 7 (New England 
30 Biolabs), pBluescript II SK(+) (Stratagene, San Diego, CA). The Human 
HeLa DNA used as one control was from Clontech (Palo Alto, CA). 
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Blotting genomic DNA 

The genomic DNA was denatured 2-3 minutes at 95*^ C and cooled on ice. 
The nylon membrane (Hybond-N, Amersham Pharmacia Biotech, 
5 Piscataway, NJ) was blotted with 100 ng of denatured genomic DNA, 
dried for 2 minutes on a hot plate and exposed to UV light for 8 minutes. 
The prehybridization was done for a minimum of 30 minutes in the 
hybridization buffer (7% SDS, 0-25M Na2HP04 pH7.4, 1mM EDTA, pH 
8.0 and 10g/L of BSA). 

10 Hybridization and washing of the starting random pool 

The preparative hybridization between random core (20N) and targeted 
DNA was done with 10 pmoles of starting random pool (RAN). The 
random pool was pre-mixed with 100 pmoles (10 times more than RAN) 
of LEFT and RIGHT blockers in order to exclude cross-hybridization of left 
15 and right arms with genomic DNA. The oligonucleotide mixture was 
heated up to QSX, cooled at room temperature and added to the 
hybridization buffer. The hybridization was done overnight at 50'' C. The 
first washing was done with 6X SSC, followed by subsequent 2X SSC 
washing at the same temperature as hybridization was done. 

20 

Generating OL by PGR 

The dot containing the genomic DNA and bound probes was cut out of 
the nylon membrane (radius of 2-4mm), soaked in 100 |al H2O and heated 
to 95°C for 1-2 minutes. The solution containing the denatured probe 
25 (originally RAN) was then collected and passed threw a Sephadex G-50 
column in order to eliminate salts and SDS. The PGR was prepared 
under standard conditions, typical for SELEX-like amplification of DNA 
(10, 13). The RIGHT 5'-end biotinylated primer of the sense strand (the 
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one which did not hybridise with genomic DNA) and LEFT primer of 
antisense strand were used in the PGR reaction. The temperature cycles 
were SS'^C, 72*'C, 95°C, each 30 seconds, repeated 20 times. 

Probe labelling and hybridization 

5 Before labelling, the PGR reaction mixtures were passed threw Sephadex 
G-50 columns. Around 100-200 ng of PGR product was labelled with 50 
pmols of yP^^ ATP (6000 Ci/mmol, I.G.N, pharmaceuticals, Irvine, GA). 
The total amount of probe radioactivity was 300 000 c.p.m. The probe 
was added into 0.5 ml of hybridization buffer. The blotting of genomic 
10 DNA was done as described above. Hybridizations were done overnight 
at 50°C. The nylon membrane was washed as previously described, and 
exposed to Kodak X-OMATfilm. 

OL labelling and analytical hybridization 

The generated OL was tested, using 1) the original genomic DNA from 
15 which they were selected (positive control) and 2) using the unrelated 
genomic DNA (negative control). The OL labelling, hybridization and 
probe washing was done as described, except that hybridization time was 
shorter (60 minutes). 

Southern blot hybridization 

20 Electrophoresis was performed in a 1% agarose gel with TBE buffer (80 
mM Tris borate, pH 8.0, 2mM Na2EDTA) and stained with ethidium 
bromide. One ug of BstEll-digested lambda DNA, 300 ng of adenoviral 
DNA and 1 pg of Alul-Hpal-digested human HeLa DNA were run on the 
gel according to specifications (all restriction enzymes used in this work 

25 were purchased from New England Biolabs). For Southern hybridization, 
DNA was transferred to Nylon membranes by capillary blot procedure 
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following manufacturer's recommendations (Amersham Pharmacia 
Biotech). Hybridization was performed as described above with 
adenoviral OL. Autoradiographic exposure (using Kodak X-OMAT film) 
was done at room temperature, for few hours. Stripping of the membrane 
5 was done by boiling a 1% SDS solution and pouring it over the Nylon 
membrane. 



Subtractive enrichment of OL 

The tester OL (mixed OL) that reflects the two genomes (Adenovirus type 

10 2 and Lambda) was made by preparing OL from equimolar mixtures of 2 
genomes. The driver OL was produced from the lambda genome only. 
The production of sense strand (the one which did not hybridize with 
genomic DNA) was done using 5-end biotinylated primer in PGR 
reaction. After denaturing PGR product, the biotinylated sense strand was 

15 bound to streptavidin magnetic particles (200 f^g, binding capacity > 200 
pmols of biotinylated oligonucleotides, Biomag Magnetic Particles, 
PerSeptive Biosystems), and pulled-out using a magnet The 
complementary antisense strand was discarded with the liquid phase. 
The mixed antisense tester OL (Lambda + Adenovirus DNA) was 

20 produced in the same way. This time, the supernatant with the antisense, 
non-biotinylated strand was hybridized overnight at 50°G with 10 times 
molar excess of driver Lambda sense stand attached to magnetic beads. 
The hybridization buffer was the same as described above but without 
SDS. After removing the fraction bound to the magnetic beads, the rest 

25 of the mixture was used in the analytical hybridization step. 

RESULTS 

The starting random pool of oligonucleotides contains 4^^ (i.e. 10^^) 
different 20-mers. The diversity of the sequence motifs is approximately 
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10^' higher than the diversity of the most complex genomes. A schematic 
representation of the procedure for generating OL is presented in Figure 
1 and is described in detail in the Experimental Methods, above. 

Blockers were used in order to avoid hybridization of the flanking arms to 
the targeted genome, and this step was found to be critical to achieve 
specificity. The stringency of hybridization conditions eliminates unbound 
20-mers, leaving the specific oligonucleotides bound to the membrane via 
hybridization of the random core to the genome (Fig. 1). This ensemble 
of selected oligonucleotides constitutes the OL, 

It should be noted that the starting random pool of oligonucleotides 
contained about 8 copies of each sequence motif during the first 
hybridization step (10-20 pmoies) and that the number of copies of each 
particular 20-mer present in the random mixture was smaller than the 
number of genome copies. 

Figure 2 shows that OLs are able to discriminate genomes with 
complexities around 10' to 10^ The starting random pool of probes binds 
to all three genomes equally (Fig. 2, row 1). After one round of selection, 
the OL can hybridise specifically towards a single targeted genome (Fig. 
2, rows 2, 3 and 4). The OL can be selected against a mixture of two 
genomes and the specificity is conserved for both genomes (Fig. 2, row 
5). 

A Southern blot was performed in order to document the distribution of 
adenovirus OL probes along the genome (Fig. 3). There was no apparent 
cross-hybridization of adenovirus OLto either HeLa or Lambda DNA (Fig. 
3b, lanes 1 , 4, 5 and 6). The intensity of radioactive signal over adenoviral 
genome generated by adenovirus-specific OL was linearly increasing with 
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the DNA fragments' length (Fig, 4). Therefore, one could deduce a 
uniform distribution of OL throughout the genomic DNA. 

In the next step, only the subset of the adenovirus OL bound to the 3648 
bp band in Figure 3b (rows 3 and 6) was reamplified and selected. The 
membrane was washed from the original probe and hybridised with the 
3648 bp subset of the original OL. Figure 3c shows that the specificity of 
the OL subset is obtained against the 3648 bp band. These data illustrate 
the successful increase in the specificity and the reduction in the 
complexity of the original Adeno-specific OL to that of the 3648 bp subset, 
using just one additional round of selection. 

One round of subtractive enrichment between two oligonucleotide 
libraries was performed as schematized (Fig. 5a)- The tester OL reflects 
15 the two genomes (adenovirus type 2 and lambda phage). The driver OL 
was produced from the Lambda genome only. The single stranded (ss) 
OL from the driver DNA was used to pool out the complementary single 
stranded, mixed, tester OL. After removing the subtracted fraction, the 
rest of ssDNA was used as a probe in the analytical hybridization step. 
20 The intensity of hybridization signals between Lambda and Adeno 
genomes, before (Fig. 5b) and after (Figure 5c) one round of subtractive 
enrichment was shown. It should be noted that further subtraction steps 
could be performed by changing the sequence design of flanking arms 
between tester and driver OLs, as suggested by recent developments in 
25 subtractive procedures (14). 

With reference to Figure 6, the relative distribution of 20-mers with 
different numbers of mismatches that hybridized to the targeted DNA was 
predicted. The number of 20-mers (N) with (m) number of mismatches 
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(m=0, 1, 20) capable of hybridizing to the target sequence were then 
calculated. First, the number of combinations of 20-mers (C) with the 
same number of mismatches (m) in the initial random pool of 
oligonucleotides that are capable of hybridizing to a specific 20-mer motif 
was calculated. Since each full match could be replaced by 3 different 
mismatches, the number of combinations must be multiplied by 3"" i.e. 
0*3"". Finally, these numbers were adjusted to reflect the sequence- 
dependent termostability of 20-mers with (m =0, 1,,.20) number of 
mismatches which hybridised at 52X using the thermostable fraction of 
the binomial distribution for each n-mer population. Duplexes of more 
than 7 mismatches are not observed based on this themostability 
criterion. Therefore, the majority of 20-mers captured after the first round 
of selection will harbour less than 6 mismatches. 

A process that generates amplifiable DNA oligonucleotide libraries which 
are specific for a given segment of DNA has been described. This 
process is akin to random priming, because it is possible to generate 
probes without a priori knowledge of the template sequence. One round 
of preparative hybridization was enough to produce genome-specific 
oligonucleotide libraries (Figs. 2 and 3). The OLs were inferred from 
genomes of complexity of 10^-10"^. 

The process described herein generates probes with high detection 
power. These probes/selected oligonucleotides can contain mismatches. 
The notion that introduction of artificial mismatches could increase 
detection power of oligonucleotides during single nucleotide 
polymorphism (SNP) detection was well documented by Quo et al (6). 
However, the prediction of positions and types of mismatches, which 
should be introduced to increase detectability of oligonucleotide, remains 
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undefined. Consequently, to enhance oligonucleotide detectability by 
introducing (artificial) mismatches, one must search different positions 
and types of mismatches along the oltgonucletide. Once they are 
empirically determined, i.e, tested on 2 different sequence motifs, the 
5 oligonucleotide containing particular mismatches could be used (15). 
The present process provides an approach based on differential selection 
of thermostable oligonucleotides (i.e. their differential stability), which are 
present in one, but not in the second system. The selection of 
oligonucleotides with the highest detectability is inherently present in this 
10 process, i.e. the method suggests a solution to the problem of where and 
what type of mismatches should be introduced to increase detection 
power of oligonucleotide, or to find the particular oligonucleotide which 
best discriminates between 2 sequence motifs which may differ by a 
single base. 

15 

Without wishing to be bound by any hypothesis, the following provides an 
explanation of what is believed to be occurring during the process of the 
present invention. Based on calculations, it is expected that the 20-mers 
selected in an OL can contain up to 6 mismatches. Nevertheless, 

20 specificity toward a given template was achieved, suggesting that the 
presence of these putative mismatches did not interfere with good 
discrimination. In other words, mismatch-free hybridization is not critical 
for differential detection approach; rather, the relative differences in the 
thermodynamical stabilities of the hybridized oligonucleotides appear to 

25 be determinative. The present process uses selection of oligonucleotides 
based on this criterion and therefore provides the possibility of 
overcoming current technological limitations. In the second and further 
rounds of selection, the number of 20-mers both in the targeted genome 
and the probe mixture (OL) could be adjusted. Each new round of 
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preparative hybridization (Fig.1) and/or subtraction (Fig. 6) could reduce 
the complexity of OL, by using the excess OL rather than genomic DNA, 
Therefore, the average number of mismatches for each particular 20-mer 
will continue to decline until it reaches the sequence-dependent limitation, 
5 but not the concentration-dependent limitation. 



In summary, OLs are generated from the template DMAs. These OLs are 
used in subtractive hybridization, for example between genomic or cDNA- 
based libraries (OL1 and OL2) to make a new Subtractive Oligonucleotide 

10 Library (SOL1/2 and/or SOL2/1)), that is/are specific for one 
system/library but not for the other. Oligonucleotides isolated from such 
subtractive libraries (SOL) are useful for diagnostic purposes. They can 
a) directly serve as highly specific hybridization probes or b) they can be 
tested for PCR-specific differential amplification, specific for one, but not 

1 5 the other biological system. 

These libraries (OL or SOL) can be hybridized to oligonucleotide chip 
arrays in order to obtain a specific hybridization pattern that is useful for 
diagnostic features: each OL produces an image which is specific for the 

20 templated DNA (genome or cDNA). A particular advantage in using OL or 
SOL instead of genomic/cDNA libraries is that the hybridization signal is 
not dependent on copy number and distribution of particular sequence 
motifs. By comparing images of different genomes/cDNA, one can deduce 
which oligonucleotides are highly specific for a single genome/cDNA, and 

25 use this or these oligonucleotide(s) as "genome tags". The 
oligonucleotides obtained can also be used for specific diagnostic PGR. 
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OLs or SOLs can be inferred fronn two biologically relevant systems, like 
mammalian cells, to detect fine differences in cell cycle, tissue status, viral 
infection, age/development status etc. 

5 Although the present invention has been described hereinabove by way 
of a preferred embodiment, it can be modified by one of skill in the art 
without departing from the spirit and nature of the subject invention, as 
defined more particularly in the appended claims. 
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WHAT IS CLAIMED IS: 



1. A process for generating a library of oligonucleotides that are 
specific for a given set of nucleic acids, comprising: 

5 a) generating random oligonucleotides, wherein said 

oligonucleotides are of a uniform length comprising a single- 
stranded, central segment of randomly varied bases and flanking 
segments of defined sequences on each side of said central 
segment; 

10 b) hybridizing the random oligonucleotides of step a) with a 

nucleic acid-containing template of biological or synthetic origin 
under hybridization conditions that enable the formation of 
duplexes and using blockers to avoid hybridization of said flanking 
segments; 

15 c) eliminating non-specific duplexes formed in step b) using 

conditions that minimize or abrogate mismatches; 

d) separating the hybridized oligonucleotides from the duplexes 
obtained in step c); and 

e) amplifying the oligonucleotides obtained in step d). 

20 

2. A process as defined in claim 1, further comprising the step of 

f) subtracting between two different oligonucleotide libraries 
(OL1 and OL2) which contain similar sequence motifs. 

25 3. A process as defined in claim 2, wherein said subtracting in step f) 
consists in: 

a) Generating single stranded versions of OL1 and OL2; 

b) annealing the OL1 strands with an excess of OL2 strands, under 
hybridization conditions; 
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c) partitioning double stranded hybrids (OL1;OL2) and single 
stranded OL2 from single stranded OL1 ; 

d) amplifying the single stranded OL1 obtained from step c); and 

e) repeating steps a) to d) to obtain OL1 oligonucleotides with 
5 reduced affinity for OL2. 



4. A process as defined in any one of claims 1 to 3, wherein said 
central segment comprises 10-40 bases and each one of said flanking 
segments comprises 10-40 bases. 

10 

5. A process as defined in claim 4, wherein said central segment 
comprises 20 bases and each one of said flanking segments comprises 
20 bases. 

15 6. A process as defined in any one of claims 1 to 3, wherein the 
template of step b) contains at least one of genomic or synthetic DNA or 
RNA, or cDNA. 



7. A process as defined in claim 3, wherein said partitioning is carried 
20 out using streptavidin and biotin. 

8. A library of oligonucleotides produced by the process of any one 
of claims 1 to 7. 

25 9. Use of a library of oligonucleotides produced by the process of any 
one of claims 1 to 7 in a diagnostic kit. 

10. Use of a library of oligonucleotides produced by the process of any 
one of claims 1 to 7 to inhibit gene function. 
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11. A method of diagnosis comprising use of a library of 
oligonucleotides produced by the process of any one of clainns 1 to 7. 

5 12. Use of a library of oligonucleotides produced by the process of any 
one of clainns 1 to 7 wherein said oligonucleotides are bound to a solid 
support. 

13. A use as defined in clainn 12, wherein said solid support is at least 
10 one of a membrane, glass slide, coated glass slide, printed arrays, 

microspheres or chromatographic media, 

14. Use of a library of oligonucleotides produced by the process of any 
one of claims 1 to 7, wherein said oligonucleotides are hybridized to 

15 nucleic acid arrays. 
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(57) Abstract 

A process for the generation of oligonucleotide libraries 
representative of a given template is described. Starting from a 
random pool of oligonucleotides, the process selects only those 
which hybridize to the template nucleic acid. This selection 
yields a highly specific library that represents an oligo-image 
of the chosen template. The novel quality of this approach is 
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subjected to differential selection. This technique is used to 
produce different oligonucleotide libraries (OLs) and shows 
that these OLs do not cross-hybridize. Differential selection 
of these OLs produces oligonucleotides that can be used in the 
identification, characterization and isolation of nucleic acids. 
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the subject matter which is claimed and for which a patent is sought on the invention entitled PROCESS 
FOR THE GENERATION OF OLIGONUCLEOTIDE LIBRARIES (OLs) 
REPRESENTATIVE OF GENOMES OR EXPRESSED mRNAs (cDNAs) AND USES 
THEREOF, the specification of which: 

is attached hereto. 

X was filed on July 6, 2001 as Application Serial No. 09/869,891 . 

I hereby state that 1 have reviewed and understand the contents of the above identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose to the Patent and Trademark Office all information known to 
me to be material to patentability of the subject matter claimed in this application, as "materiality" is 
defined in Title 37, Code of Federal Regulations, § 1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, § 119 of any foreign 
application(s) for patent or inventor's certificate listed below and have also identified below any foreign 
application for patent or inventor's certificate having a filing date before that of the application on which 
priority is claimed: 

PRIOR FOREIGN APPLICATION(S) Priority 

Claimed 



2,259,745 Canada Januarv 19, 1999 Yes 

(Number) (Country) (Date Filed) 

PCT/CAOO/00047 PCT Januarv 19, 2000 Yes 

(Number) (Country) (Date Filed) 

I hereby claim the benefit under Title 35, United States Code, § 120 of any United States 
application(s) listed below and, insofar as the subject matter of each of the claims of this application is not 
disclosed in the prior United States application in the manner provided by the first paragraph of Title 35, 
United States Code, § 112, 1 acknowledge the duty to disclose all information known to me to be material 
to the patentability of the subject matter claimed in this application, as "materiality" is defined in Title 37, 
Code of Federal Regulations, § 1.56, which become available between the filing date of the prior 
application and the national or PCT international filing date of this application: 

N/A ™ 

(Application Serial No.) (Filing Date) (Status) 

N/A ^ . 

(Application Serial No.) (Filing Date) (Status) 
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PATENT 





I hereby claim the benefit under title 35, United States code § 119(e) of any United States 
provisional application(s) listed below: 



(Application Serial No.) (Filing Date) 



(Application Serial No.) (Filing Date) 



Please direct all communications as follows: 

^ric B. Mevertons 
CONLEY, ROSE &j rAYQNJP.C. 
P.O. Box 398 

Aust in, Texas 78767-0398 
F^Tn?T2y476-1400 

I hereby declare that all statements made of my own knowledge are true and that all statements 
made on information and belief are believed to be true; and further that these statements were made with 
the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or 
both, under Section 1001 of Title 18 of the United States Code and that such willful false statements may 
Jeopardize the validity of the application or any patent issued thereon. 

Inventor's Full Name: Bruno 

Inventor's Signature: 

Country of Residence: C A /W P Citizenship: (Lx>v v..^i^^^yiLA^ 

Post Office Address: 289 RandilK Chateauguav. Quebec. J6J 2P4 CANADA Cl^^f^^V^ 




(Include number, street name, city, state and zip code) 



Inventor's Full Name: a Ivan Brukner 




Inventor's Signature: jh^^^----^^ ^ Date: ( / O/ 

Country of Residence: Citizenship: 

Post Office Address: 1882 Sherbrooke East #2, Montre al,. Quebec H 2K 1B5 CANADA 

(Include numberTsffeSt name, city, state and zip code) 
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O fi -^^^1 S . J.. 




Inventor's Full Name: 
Inventor's Signature: 
Country of Residence: 



Guy Tremblay 




Date: ^\ 
Citizenship: CjN/v^-eAjX, 



Post Office Address: 3341 Marechal #4,,MQfttr£aLDuebe c H3T 1M8 CANADA _ 

(Include number, street name, city, state and zip code) 



OA- 
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PATENT 



Applicant or Patentee: 
Serial or Patent No.: 
Filed or Issued; 
For: 



Bruno Paquin. 



_ Attorney's 

Docket No.: 5593-00300/EBM 



PROCESS FOR THE GENERATION OF OLIGONUCLEOTIDE LIBRARIES (OLs) 
REPRESENTATIVE OF GENOMES OR EXPRESSED mRNAs (cDNAs) AND USES 
THEREOF 



VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY 
STATUS (37 CFR 1.9(F) AND L27(D)) - NONPROFIT ORGANIZATION 



I hereby declare that I am 

[ ] the owner of the small business concern identified below: 

[X] an official of the small business concern empowered to act on behalf of the concern identified 
below: 

NAME OF CONCERN: UNIVERSITE DE MONTREAL 

ADDRESS OF CONCERN: Postal Code 6 1 28 Station A 

Montreal, Quebec. H3C 3J7 CANADA 



I hereby declare that the above identified nonprofit organizatiOon qualifies as a nonprofit organization as 
defined in 37 CFR 1.9(e), for purposes of paying reduced fees under Section 41(a) and (b) of Title 35 United 
States Code, in that the above identified nonprofit organization is a university or other institution of higher 
education located in any country. 

I hereby declare that rights under contract or law have been conveyed to and remain with the small business 
concern identified above with regard to the invention entitled PROCESS FOR THE GENERATION 
OF OLIGONUCLEOTIDE LIBRARIES (OLs) REPRESENTATIVE OF GENOMES OR 
EXPRESSED mRNAs (cDNAs) AND USES THEREOF by inventor(s) described in 

[ ] the specification filed herewith 

[X] application serial no.: 09/869,891 , filed July 6, 2001 

[ ] patent no.: , issued 

If the rights held by the above identified nonprofit organization are not exclusive, each individual, 
concern or organization having rights to the invention is listed below* and no rights to the invention are held by 
any person, other than the inventor, who could not qualify as a small business concern under 37 CFR 1.9(d) or 
by any concern which would not qualify as a small business concern under 37 CFR 1.9(d) or a nonprofit 
organization under 37 CFR L9(e). 

*NOTE: Separate verified statements are required from each named person, concern or organization 
having rights to the invention averring to their status as small entities. (37 CFR 1.27) 
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FULL NAME: 



ADDRESS: 



[ 1 Individual [ ] Small Business Concern [X] Nonprofit Organization 



I acknowledge the duty to file, in this application or patent, notification of any change in status resulting 
in loss of entitlement to small entity status prior to paying, or at the time of paying, the earliest of the issue fee 
or any maintenance fee due after the date on which status as a small entity is no longer appropriate. (37 CFR 



I hereby declare that all statements made herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true; and further that these statements are made with the 
knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code, and that such willful false statements may jeopardize 
the validity of the application, any patent issuing thereon, or any patent to which this verified statement is 
directed. 

NAME OF PERSON SIGMNG: Jean Yvon Timofhy 

TITLE OF PERSON OTHER THAN OWNER: Director Industrial Liaison Office and Grants 



ADDRESS OF PERSON SIGNING: Same as above 



L28(b)) 



SIGNATURE: 




DATE: October 22, 2001 
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